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AMENDMENTS TO THE CLAIMS 

This listing of claims will replace all prior versions, and listings, of claims in the 
application: 

CLAIMS 

1 . (Original) A method of back-off modelling for use in named entity recognition 
of a text, comprising, for an initial pattem entry from the text: 

relaxing one or more constraints of the initial pattem entry; 
determining if the pattem entry after constraint relaxation has a valid form; and 
moving iteratively up the semantic hierarchy of the constraint if the pattem entry 
after constraint relaxation is determined not to have a valid form. 

2. (Original) A method according to claim 1 , wherein moving iteratively up the 
semantic hierarchy of the constraint if the pattem entry after constraint relaxation is 
determined not to have a valid form comprises: 

moving up the semantic hierarchy of the constraint; 
relaxing the constraint ftirther; and 

returning to determining if the pattem entry after constraint relaxation has a valid 

form. 

3. (Currently Amended) A method according to claim 1 of-S, further comprising: 
determining if a constraint in the pattem entry, after relaxation, also has a valid form; 

and 

moving iteratively up the semantic hierarchy of the constraint if the constraint in the 
pattem entry after constraint relaxation is determined not to have a valid form. 

4. (Original) A method according to claim 3, wherein moving iteratively up the 
semantic hierarchy of the constraint if the constraint in the pattem entry after constraint 
relaxation is determined not to have a valid form comprises: 
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moving up the semantic hierarchy of the constraint; 
relaxing the constraint further; and 

returning to determining if a constraint in the pattern entry after constraint relaxation 
has a valid form. 

5. (Currently Amended) A method according to any on e of th e prec e ding claims claim 
i, wherein if a constraint is relaxed, the constraint is dropped entirely from the pattem entry 
if the relaxation reaches the root of the semantic hierarchy. 

6. (Currently Amended) A method according to any on e of the preceding claims claim 
i, further comprising terminating if a near optimal frequently occurring pattem entry is 
reached to replace the initial pattem entry. 

7. (Currently Amended) A method according to any on e of the prec e ding claims claim 
i, further comprising selecting the initial pattem entry for back-off modelling if it is not a 
frequently occurring pattern entry in a lexicon. 

8. (Original) A method of inducing patterns in a pattem lexicon comprising a 
plurality of initial pattem entries with associated occurrence frequencies, the method 
comprising: 

identifying one or more initial pattem entries in the lexicon with lower occurrence 
frequencies; and 

relaxing one or more constraints of individual ones of the identified one or more 
initial pattem entries to broaden the coverage of the identified one or more initial pattem 
entries. 

9. (Original) A method according to claim 8, further comprising creating the 
pattem lexicon of initial pattem entries from a training corpus. 
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10. (Currently Amended) A method according to claim 8 e^^, further comprising 
merging individual ones of the constraint relaxed initial pattem entries with similar pattern 
entries in the lexicon to form a more compact pattem lexicon. 

1 1 . (Currently Amended) A method according to claim 9 or 10 , wherein the entries in 
the compact pattem lexicon are generalised as much as possible within a given similarity 
threshold. 



12. (Currently Amended) A method according to any on e of claims 8 to 1 1 claim 8. 
further comprising: 

determining if the pattem entry after constraint relaxation has a valid form; and 
moving iteratively up the semantic hierarchy of the constraint if the pattem entry 
after constraint relaxation is determined not to have a valid form. 

13. (Original) A method according to claim 12, wherein moving iteratively up the 
semantic hierarchy of the constraint if the pattem entry after constraint relaxation is 
determined not to have a valid form comprises: 

moving up the semantic hierarchy of the constraint; 
relaxing the constraint further; and 

retuming to determining if the pattem entry after constraint relaxation has a valid 

form. 



14. (Currently Amended) A method according to claim 12 or 13 , further comprising: 

determining if a constraint in the pattem entry, after relaxation, also has a valid form; 

and 

moving iteratively up the semantic hierarchy of the constraint if the constraint in the 
pattem entry after constraint relaxation is determined not to have a valid form. 
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15. (Original) A method according to claim 14, wherein moving iteratively up the 

semantic hierarchy of the constraint if the constraint in the pattern entry after constraint 
relaxation is determined not to have a valid form comprises: 
moving up the semantic hierarchy of the constraint; 
relaxing the constraint further; and 

returning to determining if a constraint in the pattem entry after constraint relaxation 
has a valid form. 



16. (Currently Amended) A decoding process in a rich feature space comprising a 
method according to any one of claims 1 to 7 claim 1 . 

1 7. (Currently Amended) A training process in a rich feature space comprising a method 
according to any on e of claims 8 to 1 5 claim 8 . 

1 8. (Original) A system for recognising and classifying named entities within a text, 



comprising: 

feature extraction means for extracting various features from a document; 
recognition kernel means to recognise and classify named entities using a Hidden 
Markov Model; and 

back-off modelling means for back-off modelling by constraint relaxation to deal 
with data sparseness in a rich feature space. 

19. (Currently Amended) A system according to claim 1 8, wherein the back-off 
modelling means is operable to provide a method of back-off modelling according to any 
on e of claims 1 to 7 claim 1 . 

20. (Currently Amended) A system according to claim 1 8 or 19 , further comprising a 
pattem induction means for inducing frequently occurring patterns. 
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21 . (Currently Amended) A system according to claim 20, wherein the pattern induction 
means is operable to provide a method of inducing patterns according to any on e of claims 8 
to 15 claim 8 . 

22. (Currently Amended) A system according to anyon e of claims 18 to 21 claim 18, 
wherein said various features are extracted from words within the text and the discourse of 
the text, and comprise one or more of: 

[[a.]] a) deterministic features of words, including capitalisation or digitalisation; 

[[b.]]b} semantic features of trigger words; 

[[c-]]c} gazetteer features, which determine whether and how the current word 
string appears in a gazetteer list; 

[[d.]]d} discourse features, which deal with the phenomena of name alias; and 
the words themselves. 

23. A feature set for use in back-off modelling in a Hidden Markov Model, during 
named entity recognition, wherein the feature sets are arranged hierarchically to allow for 
data sparseness. 
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